List of AI News about AI evaluation standards
Time | Details |
---|---|
2025-08-05 17:26 |
OpenAI Study: Adversarial Fine-Tuning of gpt-oss-120b Reveals Limits in Achieving High Capability for Open-Weight AI Models
According to OpenAI (@OpenAI), an adversarial fine-tuning experiment on the open-weight large language model gpt-oss-120b demonstrated that, even with robust fine-tuning techniques, the model did not reach high capability under OpenAI's Preparedness Framework. External experts reviewed the methodology, reinforcing the credibility of the findings. This marks a significant advancement in establishing new safety and evaluation standards for open-weight AI models, which is crucial for enterprises and developers aiming to utilize open-source AI systems with improved risk assessment and compliance. The study highlights both the opportunities and the limitations of open-weight AI model deployment in enterprise and research environments (Source: openai.com/index/estimating-...). |